Search CORE

183 research outputs found

Web page scoring systems for horizontal and vertical search

Author: Marco Gori
Marco Maggini
Michelangelo Diligenti
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2004
Field of study

Neural computation, social networks, and topological spectra

Author: Diligenti Michelangelo
Gori Marco
Maggini Marco
Publication venue: Published by Elsevier B.V.
Publication date
Field of study

AbstractThis paper emphasizes some intriguing links between neural computation on graphical domains and social networks, like those used in nowadays search engines to score the page authority. It is pointed out that the introduction of web domains creates a unified mathematical framework for these computational schemes. It is shown that one of the major limitations of currently used connectionist models, namely their scarce ability to capture the topological features of patterns, can be effectively faced by computing the node rank according to social-based computation, like Google's PageRank. The main contribution of the paper is the introduction of a novel graph spectral notion, which can be naturally used for the graph isomorphism problem. In particular, a class of graphs is introduced for which the problem is proven to be polynomial. It is also pointed out that the derived spectral representations can be nicely combined with learning, thus opening the doors to many applications typically faced within the framework of neural computation

Elsevier - Publisher Connector

Cluster Generation and Cluster Labelling for Web Snippets: A Fast and Accurate Hierarchical Solution

Author: Geraci Filippo
Maggini Marco
Pellegrini Marco
Sebastiani Fabrizio
Publication venue
Publication date: 01/01/2006
Field of study

This paper describes Armil, a meta-search engine that groups into disjoint labelled clusters the Web snippets returned by auxiliary search engines. The cluster labels generated by Armil provide the user with a compact guide to assessing the relevance of each cluster to her information need. Strik- ing the right balance between running time and cluster well- formedness was a key point in the design of our system. Both the clustering and the labelling tasks are performed on the ?y by processing only the snippets provided by the auxil- iary search engines, and use no external sources of knowl- edge. Clustering is performed by means of a fast version of the furthest-point-?rst algorithm for metric k-center cluster- ing. Cluster labelling is achieved by combining intra-cluster and inter-cluster term extraction based on a variant of the information gain measure. We have tested the clustering ef- fectiveness of Armil against Vivisimo, the de facto industrial standard in Web snippet clustering, using as benchmark a comprehensive set of snippets obtained from the Open Di- rectory Project hierarchy. According to two widely accepted external\u27 metrics of clustering quality, Armil achieves bet- ter performance levels by 10%. We also report the results of a thorough user evaluation of both the clustering and the cluster labelling algorithms. On a standard 1GHz ma- chine, Armil performs clustering and labelling altogether in less than one second

Archivio della Ricerca - Università degli Studi di Siena

PUblication MAnagement

Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian Language

Author: Andrea Zugarini
Marco Maggini
Matteo Tiezzi
Publication venue: International Committee on Computational Linguistics (ICCL)
Publication date: 01/01/2020
Field of study

Italian is a Romance language that has its roots in Vulgar Latin. The birth of the modern Italian started in Tuscany around the 14th century, and it is mainly attributed to the works of Dante Alighieri, Francesco Petrarca and Giovanni Boccaccio, who are among the most acclaimed authors of the medieval age in Tuscany. However, Italy has been characterized by a high variety of dialects, which are often loosely related to each other, due to the past fragmentation of the territory. Italian has absorbed influences from many of these dialects, as also from other languages due to dominion of portions of the country by other nations, such as Spain and France. In this work we present Vulgaris, a project aimed at studying a corpus of Italian textual resources from authors of different regions, ranging in a time period between 1200 and 1600. Each composition is associated to its author, and authors are also grouped in families, i.e. sharing similar stylistic/chronological characteristics. Hence, the dataset is not only a valuable resource for studying the diachronic evolution of Italian and the differences between its dialects, but it is also useful to investigate stylistic aspects between single authors. We provide a detailed statistical analysis of the data, and a corpus-driven study in dialectology and diachronic varieties

Archivio della Ricerca - Università degli Studi di Siena

Controllo Coordinato con Sensori Acustici di Robot Subacquei

Author: Maggini Marco
Publication venue: 'Pisa University Press'
Publication date: 01/03/2005
Field of study

Il lavoro presentato nasce come proposta di soluzione alternativa in seno al progetto denominato SITAR, in merito all'individuazione di siti inquinanti sotto il fondo marino. Obiettivo della tesi svolta è l'implementazione di un sistema di controllo per due robot subacquei, semiautonomi e cooperanti, mediante l'utilizzo di sensori acustici. Oltre allo sviluppo del controllore viene analizzata quale sia l'influenza della frequenza del segnale di comunicazione fra i veicoli sulle problematiche di controllo

Electronic Thesis and Dissertation Archive - Università di Pisa

Relational Neural Machines

Author: Diligenti Michelangelo
Giannini Francesco
Gori Marco
Maggini Marco
Marra Giuseppe
Publication venue
Publication date: 01/01/2020
Field of study

Deep learning has been shown to achieve impressive results in several tasks where a large amount of training data is available. However, deep learning solely focuses on the accuracy of the predictions, neglecting the reasoning process leading to a decision, which is a major issue in life-critical applications. Probabilistic logic reasoning allows to exploit both statistical regularities and specific domain expertise to perform reasoning under uncertainty, but its scalability and brittle integration with the layers processing the sensory data have greatly limited its applications. For these reasons, combining deep architectures and probabilistic logic reasoning is a fundamental goal towards the development of intelligent agents operating in complex environments. This paper presents Relational Neural Machines, a novel framework allowing to jointly train the parameters of the learners and of a First--Order Logic based reasoner. A Relational Neural Machine is able to recover both classical learning from supervised data in case of pure sub-symbolic learning, and Markov Logic Networks in case of pure symbolic reasoning, while allowing to jointly train and perform inference in hybrid learning tasks. Proper algorithmic solutions are devised to make learning and inference tractable in large-scale problems. The experiments show promising results in different relational tasks

arXiv.org e-Print Archive

Archivio della Ricerca - Università degli Studi di Siena

Focus of Attention Improves Information Transfer in Visual Features

Author: Betti Alessandro
Gori Marco
Maggini Marco
Melacci Stefano
Tiezzi Matteo
Publication venue
Publication date: 01/01/2020
Field of study

Unsupervised learning from continuous visual streams is a challenging problem that cannot be naturally and efficiently managed in the classic batch-mode setting of computation. The information stream must be carefully processed accordingly to an appropriate spatio-temporal distribution of the visual data, while most approaches of learning commonly assume uniform probability density. In this paper we focus on unsupervised learning for transferring visual information in a truly online setting by using a computational model that is inspired to the principle of least action in physics. The maximization of the mutual information is carried out by a temporal process which yields online estimation of the entropy terms. The model, which is based on second-order differential equations, maximizes the information transfer from the input to a discrete space of symbols related to the visual features of the input, whose computation is supported by hidden neurons. In order to better structure the input probability distribution, we use a human-like focus of attention model that, coherently with the information maximization model, is also based on second-order differential equations. We provide experimental results to support the theory by showing that the spatio-temporal filtering induced by the focus of attention allows the system to globally transfer more information from the input stream over the focused areas and, in some contexts, over the whole frames with respect to the unfiltered case that yields uniform probability distributions

arXiv.org e-Print Archive

Archivio della Ricerca - Università degli Studi di Siena

INRIA a CCSD electronic archive server